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ABSTRACT 


We consider a sensor that is subject to false positive and false negative errors. The 
sensor searches for stationary threat objects such as ballistic missile launchers or 
improvised explosive devices. The objects are located in a certain area of interest, which 
is divided into area-cells. The area-cells are defined such that each of them may contain, 
at most, one threat object. The task of the sensor is to determine if an area-cell contains a 
threat object, and the objective of the searcher is to maximize the number of correctly 
determined area-cells. 

Since definitive identification of a threat object, and subsequent handling of that 
threat, are done by a limited number of available ground combat units, the correct 
determination of an area-cell is cmcial for better allocating and directing these scarce 
resources. We develop an algorithm, rooted in large deviations theory and stochastic 
approximation theory, that leads to the optimal search effort. The computed allocation 
maximizes the expected number correctly determined area-cells as the number of 
available looks for searching becomes large. 
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EXECUTIVE SUMMARY 


Advances in sensing, unmanned aerial vehicles (UAVs), and satellite technologies 
have vastly increased the military use of aerial or space sensors for detecting threat 
objects such as improvised explosive devices or missile launchers. These advanced 
technologies may generate powerful and effective sensors, which necessitate operational 
concepts in order to facilitate their efficient utilization. A typical scenario where such 
operational concepts are needed is related to maritime interdiction missions where a naval 
task force is patrolling a certain area of interest (e.g., Horn of Africa) attempting to detect 
and intercept threat objects (boats) such as terrorists, pirates and weapon smugglers. Such 
threat objects are relatively rare and therefore it is safe to assume that if the area of 
interest is divided into a grid of area-cells of size, say, five square miles, then each area¬ 
cell may contain at most one stationary threat object. The task force employs unmanned 
aerial vehicles, equipped with electro-optical or other sensor for surveillance and search. 
Once a threat object is detected, the task force dispatches an armed inspection unit to 
investigate the threat. Obviously, a false positive detection results in unnecessary 
deployment of the intercepting unit, while a false negative detection may result in severe 
operational consequences such as terrorist or pirate attacks. Arguably, the cost of a false 
positive detection is much smaller than the cost of false negative detection. The objective 
is to minimize the expected total cost. 

In this thesis, we address operational concepts associated with employing sensors 
in persistent search missions over an extended search area. Specifically, we consider the 
problem of efficiently allocating adaptive sensors across a search area of interest. The 
sensors are adaptive in the sense that the search plan is not set in advance, but rather is 
updated in real time during the search process as new information is generated by the 
sensor. 

Assume that one sensor is assigned an area of interest to search, which is 
partitioned into a grid of m area-cells. The sensor operates in glimpses or looks. A look 
may be viewed as a nominal period of time for inspecting a certain area-cell. The sensor 
may spend several looks (i.e., extended inspection time) in a certain area-cell. Each look 
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generates a cue or signal: detection or no-detection. A cue may be correct or erroneous 
(false positive or false negative). Suppose that the sensor has I looks that it can apply to 
the search, and these looks are allocated to the various area-cells dynamically as the 
search mission evolves. 

When the number of looks available I is small, this can be done by solving a 
dynamic program. However, due to the curse of dimensionality [11], the computational 
cost of solving a dynamic program grows exponentially in I m, which precludes its use 
when the number of looks available I is moderately large. This thesis is aimed at the 
situation when I is large. The optimal search effort allocation is determined in two 
stages: First, presuming knowledge of the presence/absence of a threat object in each 
area-cell, use large deviations theory to characterize the optimal effort allocation; and 
second, use adaptive ideas to generate a search sequence that provably converges to the 
optimal effort allocation determined in the first stage. 

Large deviations theory [4] suggests that the probability of incorrectly 
determining an area-cell decays roughly exponentially fast with the number of looks. 
Proposition 1 asserts that the expected number of incorrectly determined area-cells 
decays exponentially fast, with a rate that is the smallest decay rate amongst all area- 
cells. This in turn can be used to obtain the optimal fractional allocations; the exact 
expression is shown in Proposition 2. Unfortunately, the optimal fractional allocations 
require knowledge of the presence or absence of the threat objects, which is what we are 
trying to determine. However, one can take an adaptive approach, motivated by the large 
deviations results, which guarantees optimal allocation of the search resources when the 
number of looks available is large. 

To the best of our knowledge, this is the first thesis that describes the optimal 
effort allocation in the context of target searching along multiple area-cells where the 
sensor has false positive and false negative errors, and that provides an implemental 
adaptive algorithm. We do not consider travel time among the area-cells because, for a 
large number of looks, its affect on the optimal effort allocations is not significant. This 
happens because the sensor could stay in the same area-cell for a large number of looks, 
then move to another area-cell and remain there for a long time, and so on, all the while 



satisfying the optimal search effort allocation. When the number of looks is relatively 
small one should consider resource constraints; see, for example, [13]. 
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I. INTRODUCTION 


A. OPERATIONAL MOTIVATION 

Advances in sensing, unmanned aerial vehicles (UAVs), and satellite technologies 
have vastly increased the military use of aerial or space sensors for detecting threat 
objects such as improvised explosive devices or missile launchers. These advanced 
technologies may generate powerful and effective sensors, which necessitate operational 
concepts in order to facilitate their efficient utilization. A typical scenario where such 
operational concepts are needed is related to maritime interdiction missions where a naval 
task force is patrolling a certain area of interest (e.g., Horn of Africa) attempting to detect 
and intercept threats objects (boats) such as terrorists, pirates and weapon smugglers. 
Such threat objects are relatively rare and therefore it is safe to assume that if the area of 
interest is divided into a grid of area-cells of size, say five square miles, then each area¬ 
cell may contains at most one stationary threat object. The task force employs unmanned 
aerial vehicles, equipped with electro-optical or other sensor for surveillance and search. 
Once a threat object is detected, the task force dispatches an armed inspection unit to 
investigate the threat. Obviously, a false positive detection results in unnecessary 
deployment of the intercepting unit, while a false negative detection may result in serve 
operational consequences such as terrorist or pirate attacks. Arguably, the cost of a false 
positive detection is much smaller than the cost of false negative detection. The objective 
is to minimize the expected total cost. 

In this thesis, we address operational concepts associated with employing sensors 
in persistent search missions over an extended search area. Specifically, we consider the 
problem of efficiently allocating adaptive sensors across a search area of interest. The 
sensors are adaptive in the sense that the search plan is not set in advance, but rather is 
updated in real time during the search process as new information is generated by the 
sensor. 
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B. BACKGROUND 

The theory of optimal search has a history of principal importance in military 
operations. The theory has fundamental applications to anti-submarine warfare, counter¬ 
mine warfare, and search and rescue operations. The books [6] and [15] are classical 
references in this area; with [17] a valuable recent reference. Search problems with 
discrete time and space of the type addressed in this thesis are not new. Optimal 
whereabouts search, where we seek to maximize the probability of determining which 
box contains a certain object, is studied in [1] and [5]. Chew [3] considers an optimal 
search with a stopping mle where all search outcomes are independent, conditional on the 
location of the searched object, and the search policy. Wegener [18] investigates a search 
process where the search time of a cell depends on the number of searches so far. A 
minimum cost search problem is discussed in [12], where only one search mode is 
considered and the sensor has perfect specificity. The paper [14] deals with discrete 
search with multiple sensors in order to maximize the probability of successful search of 
a single threat during a specified time period. Other discrete search problems are studied 
in [2, 10, and 19]. However, all of the aforementioned references assume that the sensor 
has perfect specificity, that is, there are no false positive detections. Our models, which 
are related to [7], relax this assumption. The results we obtain in this thesis rely on the 
large deviations theory [4], and on stochastic approximation theory [8]. 

C. INTRODUCTION 

Assume that one sensor is assigned an area of interest to search, which is 
partitioned into a grid of m area-cells. We assume that the area of interest can be 
partitioned in such a way that each-cell i, for i =1,2,..., m, contains, at most, one threat 
object. The sensor operates in glimpses or looks. A look may be viewed as a normal 
period of time for inspecting a certain area-cell. The sensor may spend several looks (i.e., 
extended inspection time) in a certain area-cell. Each look generates a cue or signal: 
detection or no detection. A cues maybe correct or erroneous (false positive or false 
negative). Suppose that the sensor has I looks that it can apply to the search, and these 
looks are allocated to the various area-cells dynamically as the mission evolves. 
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Let = 1 if area-cell i contains a threat object, and = 0 otherwise, leading to 
the hypothesis Hio : = I, and Hjj : = 0. We suppose that there is some initial 

intelligence about the presence of threat objects, which is manifested by a prior 
probability, 

m = PiC,=\), 

for / = 1, ..., m. This intelligence comes from exogenous sources such as satellite 
imaging, communication interception and human intelligence. The sensor is characterized 
by its sensitivity and specificity. For each area-cell i we have 

Ui = Pi (sensor indicates detection in area-cell i\(^. = 1 ), 
which is called the sensitivity of the sensor; the specificity of the sensor is 1 - where 
bi = Po(sensor indicates detection in area-cell i\i^^ = 0 ). 

Although the a/’s and h/’s may depend on the area-cell, we assume that they do 
not depend on the number of looks. Without loss of generality, we take a, > h,, because 
we can reverse the cue if Ui < b /. We explicitly assume that a, 7 ^ bi, for otherwise, the 
sensor would not provide any valuable information. We assume that the collection of 
search results from the various looks is independent for a given area-cell (meaning that 
there is no systematic bias in the sensor), and that the results for different area-cells are 
independent. 

D. RESEARCH MOTIVATION AND GOAL 

The motivation of this thesis is to characterize effort allocation schemes that 
employ the sensor efficiently, where the measure of effectiveness is expected number of 
correctly determined area-cells. When the number of looks available, I , is small, this 
can be done by solving a dynamic program. However, due to the curse of dimensionality 
[ 11 ], the computational cost of solving a dynamic program grows exponentially in Im, 
which precludes its use when the number of looks available I is moderately large. 

This thesis is aimed at the situation where I is large. The optimal search effort 
allocation is determined in two stages: First, presuming knowledge of the tme status of 
state of nature (i.e., equal to 0 or 1 , for i = 1 , ..., m), use large deviations theory to 
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characterize the optimal effort allocation; and second, use adaptive ideas to generate a 
search sequence that provably converges to the optimal effort allocation determined in 
the first stage. To the best of our knowledge, this is the first thesis that describes the 
optimal effort allocation in the context of threat searching along multiple area-cells where 
the sensor has false positive and false negative errors, and provides an implementable 
adaptive algorithm. We do not consider travel time among the area-cells because, for a 
large number of looks, it affects on the optimal effort allocations is not significant. This 
happens because the sensor could stay in the same area-cell for a large number of looks, 
then move to another area-cell and remain there for a long time, and so on, all the while 
satisfying the optimal search effort allocation. When the number of looks is relatively 
small, one should consider resource constraints; see, for example, [13]. 

The remainder of this thesis is organized as follows. In Chapter II, we discuss 
what we mean by determining an area-cell, using results from the hypotheses testing 
literature. In Chapter III, we determine the optimal search effort allocations, and present 
an adaptive algorithm that results in sampling allocations, which converge almost surely 
to the optimal allocations. In Chapter IV, we present a few examples that illustrate the 
main results. The concluding remarks appear in the last chapter. 
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II. PROBLEM FRAMEWORK 


A GOAL 

A key issue is the determination of whether an area-cell contains a stationary 
threat object or not. Intuitively, the average number of detections in area-cell i should 
approach a, if = 1 and h, if = 0. Since a, > bi, a good decision rule should conclude 
that a threat is present if the average number of detection cues in that area is close to or 
above a,, and that a threat is not present if the average is close to or below bi. Hence, as 
each area-cell is looked at more, it will become less likely to have a wrong determination. 

B. SET-UP 

This problem has been well studied in the statistical literature; see, for example, 
[9]. In this section, we make the relevant ideas precise and set the stage for the main 
results. 

Let the random variable Xq = 1 if the j ’th look into area-cell i is a detection signal, 
and Xij = 0 otherwise, fory = 1, where f Ts the number of looks into area-cell i. 

For each area-cell i, {X^ j\ j = l, is a collection of independent and identically 

distributed random variables with Bernoulli (a,) distribution if there is a threat object, or 
Bernoulli (bi) otherwise. 

A decision test 5* is a sequence of measurable maps with respect to X.^^ ..., X.^ 
from {0,1}^' into {0,1} such that if (x■^,...,x■ ( )= 1 then Hio is accepted, and if 
(x. i,...,x-j ) = 0 then Hifi is rejected. The error probabilities produced by the decision 
test S are 

= PiiSf rejects Hio), (Type I error probability) 

and 

= PiiSf accepts F/,;o), (Type II error probability) 

We define the sample average of the sensor signals, by 
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M 


and let Sl ^ ) = \ if 


r „ A 


\bij 




I-a, 






vl-Ay 


> 1 , 


( 1 ) 


and Sl(Xi^,...,x- f ) = 0 otherwise. In other words, 5*^* (x, ) = 1 if and only if the 

likelihood of having a threat object is greater than the likelihood of not having a threat 
object. 


The Bayes probability of error is given by 7r- + Chemoffs bound 

([4], pp.93) asserts that if 0 <;r;< 1 then 

inf lim inf — log! a, K. + B. (1 - ;r.)] = - 

5 i. ' ' 

where the infimum is taken over all the decision tests, and 

log I 



f ^ 
1/ 


I _ V 

Yi log 

Zl 

+ (1-7;) log 

^ Yi 








ri=- 


yl-agj 


log 


yl-a, b, J 


Several comments are in order. The infimum above is achieved by the decision 
test S] so that S] is optimal in the following sense: among all the decision tests, it 
minimizes the Bayes probability of error in log scale (i.e., it is the decision test with the 
largest Bayes probability of error exponential decay rate), as the number of looks f ^ ^ oo. 

Hence, for the rest of this thesis, we deal with the decisions test S] for each area-cell i = 
1 , ..., m. 


By a straightforward algebraic manipulation, it can be seen that (1) holds if and 
only if > y^. Thus the parameters y^ are determination thresholds, meaning that if 

< 7 ;, then the sensor operator declares that the area-cell does not contain a threat 
object, and if ;) > y^ then the area-cell is declared to contain a threat object. 
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III. IMPLEMENTATION OF ALGORITHM 


A. ALGORITHM APPROACH 

Let 0^ be the fraction of the search budget f that corresponds to area-cell i, so 
that the number of looks into area-cell i is l.=9.l. To make the mathematical 
proceedings less cumbersome, we work with X.{d/) (the average of the observations 
taken over 9 / looks), rather than with J), where [-J is the floor operator. Since 

our results hold for I large, they continue to be tme when the integrality condition is 
enforced, by working with a sequence that goes to infinity. 

Assume, without loss of generality, that area-cells 1,... ,r contain a threat object, 
and that area cells r + I, , m do not contain a threat object. Recall that the goal is to 
allocate the sensor in order to minimize the expected number of incorrectly determined 
area-cell. More precisely, the goal is to 

min g^{9i,...,9J, 

subject to 

m 

i=i 

9X 0 for 1=1,..., m, 

where 

r m 

g,{9„...,9J = Y,P[X,m<y,]+^P{X,m>y,]. ( 2 ) 

/=1 i=r+[ 

Our contribution is two-fold: (i) we characterize fractional allocations 9*,...,9*^ 
that are optimal (in log scale) as f ^ oo; and (ii), we provide an easily implementable 
algorithm, rooted in stochastic approximation theory that results in sampling allocations 
that provably achieve the same performance (in log scale) as the optimal allocations in 
the limit as £ —>■ co. 

Large deviations theory [4] suggests that each of the summands in (2) decays 
roughly decays exponentially fast with the number of looks. The decay rate depends on 
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the large deviations rate function /,(/,) = sup^{ 777 , -log£'exp(; 7 X, j)}, which for a non¬ 


degenerate Bernoulli random variable with mean 77 , is given by 


liiTi) = Yi log 


M. 


+ (1-7;) log 


^1-7^ 




(3) 


vl-/^.v 

for 0 < Yi < 1; see Exercise 2.2.23 of [4]. The next result, whose proof can be found in 
[16], characterizes the decay rate of the expected number of wrongly determined area- 
cells. 


1. Proposition 1 - Decay Rate 

Suppose Ui > bifor each area-cell. Then 

\im-\o^g^{e^,...,ej = -mmeX{Yi)- (4) 

^->•00 ^ i 

Proposition 1 asserts that the expected number of incorrectly determined area 
cells decay exponentially fast, with a rate that is the smallest decay rate amongst all area- 
cells. This suggests that a good allocation should maximize the slowest decay rate, i.e., 
the minimum of OJiiYi) • Th® result, from [16], shows that this approach is optimal 
among all feasible allocations. 


2. Proposition 2 - Optimal Allocation 

Suppose at > bifor each area-cell. Then 

lim —log gAO*, =--- 


where 


0: = 




I w 




For any other feasible {0^,...,0f} we have 


liminf — log eT 6 '|,..., 6 ' )>--, 


(5) 
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The elements 6* as the optimal allocation scheme, meaning that no other 
allocation achieves a higher exponential decay rate for g^(-) as the number of looks goes 
to infinity. Proposition 2 results in 

f ^ 

7; log — +(!-/;) log 

^_ 

r r ^ (]- ^T‘’ 

ZI-i n'^og ^ +(i-r,)iog 

where jii = a, for i = 1, r, and fii = bi for i = r + 1, m. It can be seen from (6) that 
the optimal fractional allocations tend to be large when (i.e., a, if a threat object is 
present, or bi otherwise) is close to the determination threshold . This happens because, 

in that case, the probability of having on “the wrong side” is relatively large, and 
more looks are needed to compensate for the bigger error probability. 

While Proposition 2 characterizes the optimal search allocation, the fractions 0* 

depend on knowledge about the presence / absence of the threat object, which is precisely 
what we are trying to determine. In the next section, we present a stochastic 
approximation algorithm that overcomes this issue and leads to fractional allocations that 
almost surely converge to the optimal 0* allocations. 

B. STOCHASTIC APPROXIMATION ALGORITHM 

First, we present the algorithm, with the intuition behind it following. Initially we 
set 

~ » 

for 0 < X; < 1, and 

r ^ r 

lAri) = +(1-7;)log -— 

Based on prior intelligence, the value x- is our best guess of at stage 0; for 
instance, x- = Ui if 7r> 0.5 and x^ = bi if < 0.5. The rate functions that lead to (6) are 
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estimated by substituting jit by X^Q in (3). The initial stage is ^ = 0 and the initial 
sample sizes are T, „ = 0 . 


1. Algorithm 

1. Generate a replicate ^ from the probability mass function 7r^' / » 

for / = 1, 

2. Update sample sizes: ^ +1, and for . 


3. Generate a sample from area-cell ^, (say) . 


4. Update and : 


A 


iJ+i 


= X,,+- 








and 




= log 


V j 


+ (1-7;) log 


V 1 ^^,e+i) 


For i + ^ , set A. = A,. ,, and /. ^. 

5. Increase 7^7+1 and go to step 1. 

To ensure that each area-cell is searched infinitely often, let (v^)”^o be an 
increasing sequence such that ^ oo and >0» where J(-) is the 

indicator function. We search all m area-cells at iteration Vj,V 2 ,..., and update the 
parameters according to step 2 and 4 of the algorithm. 

In step 1, we decide where to look next. This is accomplished by sampling from 
the probability mass function !'Y^k-Jk \ ■’ which is the best guess of the optimal 

allocation at stage I . In step 3, the searcher generates an observation by sampling from a 
Bernoulli distribution with parameter if area-cell ^ contains a threat object (i.e., ^ 
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e {1, r}), or from a Bernoulli distribution with parameter otherwise. In step 4, we 

update the sample average and sample large deviations rate function of area-cell ^. 

The see why our algorithm leads to the optimal allocations, let 6^ ^ = T,; ^ f be the 
fractional allocation in stage f of the algorithm. Hence, step 2 of the algorithm can be 
expressed as + where is the f’th replicate of ^ 

generated in step 1 of the algorithm. The recursion for 0. can be re-written as 

= ^ij. + ~ ’ 

where 

= 0 - qut ]+^ {qtj. -d-), 

and q.f =I'[l error becomes small relative to {6* -6';^)/(f+ 1) term, 

then 0. ^ follows, as f —> oo, the path of the solution of the ordinary differential equations 

0'.=0*-0.^ 

which has 0* as the unique globally asymptotically stable point. This suggests that if the 
variability introduced by the error is sufficiently small, our algorithm provides fractional 
allocations that converge almost surely to the optimal allocations. The preceding 
argument leads to: 

2. Conjecture 1 - Convergence to Optimal Allocation 

The stochastic approximation algorithm search allocations converge with 
probability one to the optimal allocations determined by (5); 

T.' f t 

almost surely as £^ 00 . 

The proof of this conjecture will appear elsewhere. 

Remark. 0*IXTi) = 011^(7k) suggests that step 2 of the algorithm could be 
replaced by 4" = arg min, J. 
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The preceding can leads to the following conjecture: the expected number of 
incorrectly determined systems area-cells produced by the stochastic approximation 
algorithm approaches 0 at the best possible rate. 

3. Conjecture 2 - Expected Number of Incorrectly Determined Area- 
cells Approaches Zero 

The expected number of incorrect determinations satisfy 

1 r w 1 

-log <r,)+ X > r,) ^, 

Li.i I...I J Lh/i (I'i) 

as i ^ CO. 
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IV. NUMERICAL VALIDATION 


A. INPUT PARAMETERS 

We now illustrate the results of the previous sections via three examples with up 
to 12 area-cells. In the first case, we consider 4 area-cells, with just the first 2 area-cells 
containing a threat object. The “baseline” sensor parameters are a = (0.8, 0.7, 0.6, 0.9), 
and b = (0.3, 0.1, 0.2, 0.2). These parameters lead to the thresholds y = (0.5609, 0.3608, 
0.3869, 0.5803) and to the optimal allocations 9* = (0.2773, 0.1659, 0.4401, 0.1167). 


B. 


ILLUSTRATION OF PROPOSITION I AND 2 


In the first example, we portray the results of propositions 1 and 2. For each area¬ 
cell, we analytically compute the exact probability of incorrect determination, given by 


z 


[w;j 

k 


k /I \^.—k 

a, (I-a,) 


for 1 = 1, ..., r, and by 


I 

k=^eridi J+i 


[w;j 

k 




for 1 = r + 1, The expected number of incorrectly determined area-cells is the sum 

of these two expressions over all the area-cells. This is shown in Figure 1, for three cases: 


Optimal allocation, 9* (cf (5) and (6)); 


• Constant allocation, with 9* =1! m\ 

• Simple allocation, where the determination thresholds are the midpoints between 
Ui and bi, (a^ -bj)/2. If the allocations are inversely proportional to the distance 

Qi - {a^ -4) / 2, the allocation becomes 9^ = -4) * / ) • 


As expected from Proposition 1, the logarithm of the expected number of 
incorrect determinations for the three types of allocation, decays linearly as the number of 
looks increases. Also, in agreement with Proposition 2, the slope of the line 
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corresponding to the optimal allocations is more negative than the slope of the line 
corresponding to the constant and simple allocations. Another take-away from Figure 1, 
which we also observed through numerical experimentation, is that the sub-exponential 
terms appear when computing the expected number of incorrectly determined area-cells 
become unnoticeable for relatively small values of I ; i.e., we get a straight line in Figure 
1 even for i small. This suggests that, although all our results apply in the limits as f ^ 
00 , the transient dies out fairly quickly. 



Figure 1. Exponential decay rate of expected number of wrongly determined area-cells 

for optimal allocation, constant allocation and simple allocation. 

In Figure 2, we illustrate the effect of increasing the number of area-cells for the 
optimal allocation, and of having the difference between a and b being small. 
Specifically, we consider the case a = (0.5, 0.7, 0.6, 0.4) and b = (0.4, 0.55, 0.55, 0.2). 

We plot the case of 4, 8 and 12 area-cells where the parameters and number of 
threat objects for the 8 and 12 area-cells scenario are copies of those for the 4 area-cell 
scenario. As can be seen, the performance decreases with the number of area-cells. This 
can be justified as follows. Recall that the optimal decay rate is 
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Z4‘(/.) ■ 

V4:=l y 

Clearly, the above expression is decreasing in m, meaning that the expected 
number of incorrectly detennined area-cells approaches zero at a slower rate; i.e., we are 
bound to make more erroneous determinations. This is the reason why the slope of the 
lines corresponding to m = 8 and m= 12 becomes closer to zero. 

We observe that the expected number of incorrectly determined area-cells is much 
larger with the current a and b than with the baseline parameters. This illustrates the 
potential operational value of our approach, since now we are working with a much 
larger proportion of incorrectly determined area-cells. 



Figure 2. Exponential decay rate of expected number of wrongly detennined area-cells 

for 4, 8 and 12 area-cells scenario using optimal allocation. 

C. ILLUSTRATION OF CONJECTURE I 

In the last example, we implement the stochastic approximation algorithm, 
without presuming knowledge of the presence or absence of the threat object in each cell. 
The goal is to illustrate conjecture 1. We generate a single replication and plot the 
fractional allocations generated by the adaptive algorithm. Figure 3 illustrates how the 
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Fractional allocations 


allocations approach the optimal allocations (6). As can be seen, the convergence to the 
optimal allocation is already fairly good for 100 looks, and becomes even better as the 
number of looks increases. 



Figure 3. Sample allocations versus optimal allocations. 
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V. CONCLUSIONS 


A. CONCLUDING REMARKS AND FUTURE WORKS 

In this thesis, we developed a model for a single sensor that searches for threat 
objects, when the sensor is subject to false positive and false negative errors. The space is 
discrete, with each area-cell containing at most one stationary threat object. We employed 
the large deviations theory to characterize the optimal search effort allocations, and 
developed a stochastic approximation algorithm. The fractions of time spent in each area¬ 
cell, generated by the algorithm, converge almost surely to the optimal search effort 
allocations. This causes the number of incorrectly determined area-cells to decay at the 
fastest possible exponential rate. Our results were illustrated via 3 numerical examples, 
all of which verified our conjectures. Our methodology is appropriate when the number 
of looks is large; when the number of looks is small, other techniques, such as numerical 
optimization and dynamic programming may be appropriate. 

The models developed in this thesis may be extended in several directions: 
dealing with multiple sensors, consideration of an arbitrary number of targets in each 
area-cell or of a single target in the area of interest, etc. 
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